The origin of degree correlations in the Internet and other networks 
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It has been argued that the observed anticorrelation between the degrees of adjacent vertices 
in the network representation of the Internet has its origin in the restriction that no two vertices 
have more than one edge connecting them. Here we introduce a formalism for modeling ensembles 
of graphs with single edges only and derive values for the exponents and correlation coefficients 
characterizing them. Our results confirm that the conjectured mechanism does indeed give rise to 
correlations of the kind seen in the Internet, although only a part of the measured correlation can 
be accounted for in this way. 



I. INTRODUCTION 

The statistical properties of networks have been the 
topic of considerable attention in the physics literature 
in recent years 0, 0, 0, B • Motivated by the availability 
of large-scale structural data for networks including the 
Internet, the World Wide Web, and social and biological 
networks of various kinds, researchers have created a wide 
selection of models of networks and processes taking place 
on networks. One topic of particular current interest is 
the issue of degree correlations in networks. A network 
or graph is in general composed of some set of nodes or 
"vertices" joined together by lines or "edges," and the 
degree of a vertex is defined to be the number of edges 
connected to the vertex. It has been found that for many 
real-world networks the degrees of the vertices at either 
end of an edge are not independent, but are correlated 
with one another, either positively or negatively [EIS0- 
A network in which the degrees of adjacent vertices are 
positively correlated is said to show assortative mixing by 
degree, whereas a network in which they are negatively 
correlated is said to show disassortative mixing. A strik- 
ing pattern that emerges when networks of different types 
are compared is that most social networks appear to be 
assortatively mixed, whereas most technological and bi- 
ological networks appear to be disassortative 0, @ • 

Of particular interest to us in this paper is the Inter- 
net. At the time of writing, the Internet forms a network 
of about 11000 vertices and 32 000 edges, and, as first 
pointed out by Pastor-Satorras et al. [5|, the degrees of 
adjacent vertices have significant anticorrelation. This is 
demonstrated by calculating the mean degree fc" n of the 
neighbors of a vertex v in the network as a function of 
the degree k v of that vertex. The resulting function is 
found to fall off with increasing k v roughly as a power- 
law k~ v with exponent v ~ 0.5, so that the higher the 
degree k v of the one vertex, the lower the mean degree 
of its neighbors. 

In a recent paper, Maslov et al. |fj] have proposed a pos- 
sible explanation for this result. Rather than supposing 
the anticorrelation of vertex degrees to be the result of 
some specific social or engineering constraints on the con- 
struction of data networks, they suggest instead a topo- 



logical explanation. Using computer simulations, they 
show for a network of the size and degree sequence of the 
Internet that the requirement that there is at most one 
edge between any pair of vertex induces degree anticorre- 
lations very similar to those observed. And indeed there 
are no double edges in the Internet, a statistically un- 
likely occurrence were we given complete freedom about 
how vertices were connected. 

The physical intuition behind the suggestion of 
Maslov et al. is that the restriction to single edges causes 
high-degree vertices to have fewer connections between 
them than they would if edges were assigned purely at 
random, and hence there must be more connections be- 
tween high-degree/low-degree vertex pairs instead. A 
similar explanation could apply in the case of other types 
of networks as well, such as directed networks. The 
World Wide Web and foodwebs are two examples of di- 
rected networks that appear to be disassortative and usu- 
ally have no double edges |2lj . 

In this paper we study the mechanism proposed by 
Maslov et al. analytically, and demonstrate that it does 
indeed produce disassortative mixing by degree of pre- 
cisely the type observed by Pastor-Satorras et al. p| . The 
particular model chosen by Maslov et al. to test their idea 
turns out to be difficult to treat analytically. They stud- 
ied the ensemble of all graphs with a particular degree 
sequence and at most one edge between any vertex pair, 
in which each allowed graph appears with equal proba- 
bility. Calculating the correlations in this ensemble re- 
quires us to enumerate binary matrices with given row 
and column sums. No closed-form solution for such an 
enumeration is known at present despite decades of study 
by mathematicians 0, 101. In this paper, therefore, we 
take a different approach, borrowing a trick from statisti- 
cal mechanics. We study an expanded "grand canonical" 
ensemble of graphs in which the number of edges is al- 
lowed to vary under the action of a chemical potential. 
As network size becomes large, the number of edges be- 
comes narrowly peaked and the predictions of the model 
become similar to those of the model of Maslov et al., 
while the calculations are far easier. (A grand canoni- 
cal ensemble of graphs has also been studied recently by 
Dorogovtsev et al. [Tj , although using a different formal- 
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ism and to a different purpose.) 

For networks with power-law degree distributions, we 
will show that indeed fc" n falls off as a power of k v and 
derive the value of the exponent v. We also calculate 
the value of the degree correlation coefficient for adja- 
cent vertices, which measures the amount of disassorta- 
tive mixing in the network. We show that the mecha- 
nism of Maslov et al. can account for some, but not all, 
of the disassortativity seen in the Internet, suggesting 
that there are also other mechanisms contributing to the 
observed degree correlations. 



II. DEFINITIONS 

The classic model in the study of graphs with ar- 
bitrary degree sequences is the so-called configuration 
model 0, 03, El 0, El El 5 m which one specifies the 
degree k v of each vertex v = 1 ... n in a network, which 
also fixes the total number of edges to be m = | J2 V ^v- 
Subject to the given degree sequence, the vertices are ran- 
domly wired to one another. The combinatorics of this 
model are however awkward and so Chung and Lu [lTj 
have recently proposed an alternative model that is in 
many ways more convenient. (Models similar to that of 
Chung and Lu have also been introduced independently 
by Dorogovtsev et al. and Caldarelli et al. As 
we will show, by making use of an extension of their 
model we can make tractable the problem of counting 
graphs with single edges only. The model of Chung and 
Lu deals with undirected networks, and we consider that 
case first. A fairly straightforward generalization to di- 
rected networks will be dealt with briefly. 



A. The network model of Chung and Lu 

In the model of Chung and Lu 0] one specifies the 
desired degrees k v of vertices v and then places edges 
between vertex pairs (v,w) with probability 



fvw — 



k k 

2m ' 



(1) 



where m = is the desired number of edges in the 

graph. The expected degree of vertex v is then 



(2) 



Thus the expected degree of each vertex is equal to its 
desired degree and the expected degree distribution is 
asymptotically equal to the distribution of the desired de- 
gree sequence, although any individual vertex may have 
a degree that differs from its desired value. (Throughout 
this paper, we denote desired values of quantities by a 
tilde (e.g., fc), expected values or ensemble means by a 
bar (e.g., fc), and actual values in a particular graph by 
undecorated characters (e.g., k).) 



However, this approach is not entirely satisfactory. For 
some degree distributions the probability f vw can exceed 
one. Physically, this means that there can be more than 
one edge between a pair of vertices, precisely the situ- 
ation that we will want to exclude in our calculations. 
Chung and Lu circumvent this problem by specifying an 
additional constraint on the distribution of desired de- 
grees, k v < \/2rh for all v. While this condition ensures 
that f vw < 1, it is strongly violated by networks like the 
Internet that have power-law degree distributions. Here, 
therefore, we adopt an alternative strategy, and adapt 
the model of Chung and Lu to incorporate an explicit 
condition that there is only one edge between every ver- 
tex pair. As we will see, this leads to some interesting 
new physics, and in particular to an explanation of the 
origin of disassortativity. 



B. Ensemble of networks with single edges 

We consider explicitly an ensemble of networks in 
which there is only a single edge between any pair of ver- 
tices. There will be an edge between the pair (i>, w) with 
probability f vw or not with probability 1 — Then 
the probability of occurrence of a particular graph G can 
be written 



r(G) = n (i - f vw ) n - 



fvu 



(3) 



(v,w) 



edges 



where the first product is over all unique vertex pairs 
(v,w) and the second is over only those pairs between 
which there is an edge. For convenience, we will write 
Pvw = fmv/(l - fvw), r = ]l( u ,t )( 1 ~ fvw), and define 
S vw to be 1 if there is an edge between v and w and zero 
otherwise. Then 



T(G) = r [] P, 



6 VW 
vw 



(4) 



(v,w) 



To progress, we need to choose a form for P vw , or 
equivalently for . We here make the particular choice 
of the factorizable form 



Pvw — /^A^Afj 



(•5) 



where [3 is a free parameter that will control the total 
number of edges in the graph, and the fugacity A,j is a 
real number assigned to each vertex v that will control the 
expected degree of that vertex. (Note that this choice is 
not the same as that of Chung and Lu, Eq. although 
in the "classical limit" of graphs with few double edges 
it becomes the same. See below.) 

The justification for our choice of P vw is that we would 
like all graphs with a given degree sequence to appear in 
our ensemble with equal probability. This is the same 
criterion applied by Maslov et al. [|| in their simulations, 
and allows us to compare our results with theirs. To see 
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that the criterion is satisfied in this case, we need only 
observe that 



(6) 



where k v is the actual degree of v in the particular 
graph G and m = \ ^2 V k v is the actual number of edges. 
Clearly for given (3 and {A^} this expression is a function 
only of the degree sequence {k v }. 



III. PREDICTIONS AND RESULTS 



We now define a grand partition function 



z = ]Tr(G) = r £ Y[p; 



5 VW 
vw 



(7) 



G 



Interchanging the order of sum and product this gives 

z = n = n 

= U(l + f3X v X w ), (8) 

(v,w) 

where we have dropped the factor of IV (As is typically 
the case with partition functions, leading factors of this 
type cancel out of all observable quantities in the theory.) 

From Eq. JfjJ we can now see that the expected degree 
k v of vertex v will be given by 



- _X v dZ _ dF 
v Z d\ v v dX v 

where F is the free energy 



(9) 



F = -logZ = - log(l + 0X v X w ). (10) 



Combining Eqs. @ and ljTU|) . we then get 

f3X v X w 



K - ^2 — 



PX V X Z 



(ii) 



The expected number of edges m is the ensemble mean 
of the exponent of (3 in the partition function, which is 
given by 



.OF 



P—=T- 

P dB A 1 



PX V Xy 



d(3 ^l + f3X v X v 



(12) 



The mean degree of the entire system z is simply 2m/n, 
where n is the total number of vertices. 

There are clear parallels between these results and the 
familiar Fermi ensemble of elementary statistical mechan- 
ics. The quantity f vw introduced earlier, which we can 
now write in the form 



fvi: 



(3X V X v 



l + /3X v X 



(13) 



lies strictly in the range from to 1, and represents the 
probability that an edge lies between a particular pair of 
vertices. This is the equivalent of the Fermi function of 
statistical mechanics. 

The mean sum of the degrees of the neighbors of a 
vertex v, which we denote A^" n , is given by 



if" 



h — 



Si 



(3X v X h 



/3X v Xy 



(14) 



with k w given by Eq. Ijlll) . and the mean degree of a 
neighbor of v is equal to fc" n = K£ n /k v . We will also 
want to calculate the correlation coefficient of the degree 
of vertices at either end of an edge 0, whose value is 
given by 



Ev ^ 3 -(2m)- 



(15) 



Although in this paper we will be dealing primarily 
with undirected networks, generalization of the theory 
to directed networks is straightforward. If f vw denotes 
the probability of existence of a directed edge from v 
to w and P vw is defined as before, then the expected out- 
degree (number of outgoing edges) of a vertex v will be 



1 V 



Pvw 



(16) 



the expected in-degree (number of incoming edges) will 
be 



p 

- 1 ii 



Lin _ \ " _ \ " r wv 

J- I w v 



(17) 



and the obvious generalizations of Eqs. I|12fl and l|14|) ap- 
ply- 



A. Example: power-law degree distribution 

We are here particularly interested in the case of the 
Internet, which, like a number of other networks, has a 
degree distribution that approximately follows a power 
law 



Pk oc 



(18) 



with r - 2.2±0.3 [HHj. The long tail of the power law 
means that the highest degree vertex pairs in the network 
would be quite likely to have more than one edge running 
between them were edges assigned at random, and the 
behavior of the network changes substantially when these 
multiple edges are disallowed. This is the origin of the 
effects observed by Maslov et al. |(J in their simulations. 

As we now show, the power-law degree distribution can 
be reproduced in our model by choosing the fugacity A 
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also to have a power-law distribution with the same ex- 
ponent t, so that the number of vertices with fugacity 
between A and A + dA is p(X) dA, where 



c\- t 





for A > Ao 
for A < Aq. 



(19) 



The lower cutoff makes the distribution normalizable, 
and C is a normalizing constant given by 



cr 1 



\- T dA 



An 



1 



(20) 



(Bear in mind that A is not restricted, as the degree is, 
to integer values.) 

Let us consider the case r = I, for which the expres- 
sions for the quantities of interest take particularly sim- 
ple forms. For this choice, the expected degree fc(A) of a 
vertex with fugacity A is 



fc(A) = n 



0W 



7 p(A')dA' 



Ao 1 + /3AA' 

= 3n[/3A A- (/3A A) 3/2 arctan((/3A A)~ 1/2 )], (21) 
and the mean degree z of the system is 

z = — = 2 / fc(A)p(A)dA 



9n/3A z 



l-i$ 



L_ 2 i 

/3A 2 ' ' 2 



(22) 



where $(x, a, b) denotes the analytic continuation of the 
Lerch transcendent. 

The parameter (3 is to some extent redundant in these 
expressions, since we are free to choose A as we wish, 
but it proves convenient nonetheless. If we choose (3 — 
(2m) _1 , where fh is the desired number of edges as be- 
fore, then for graphs in which there are few double edges 
we have f vw ~ (X V X W ) / (2fh) , giving k v = J2w fvw — 
J2w(^vX w ) I '(2m) ~ A w , so that the fugacity is simply 
equal to the desired degree of a vertex, as in the model 
of Chung and Lu . 

The regime in which there are few double edges can be 
thought of as the "classical limit" of our Fermi ensem- 
ble, and corresponds to the case where the first terms 
in Eqs. I|21|l and l|22|) dominate. As A becomes large, 
however, encouraging vertices to have high degree, we 
enter the quantum regime, where it becomes harder and 
harder for vertices to find others to connect to. This is 
reflected in Eq. \2A\ also. Expanding the inverse tangent 

r 5 — 0(a; 7 ), we find that the 



as arctanx = x 



I T 3 
3 



leading term cancels and 



k(X) 



3n 
5/3A A 



nO((/3A A)- 



(23) 



Thus, as A — > oo the degree tends to n, as we would 
expect, since this is the largest degree a vertex can have 
on a network with no double edges. 



10000 



1000 




FIG. 1: The ensemble mean k of the degree of a vertex in 
our model as a function of the fugacity A of the vertex. The 
numerical results are averaged over 1000 repetitions of the 
simulation. The dotted line indicates the form k = A, which 
the curve is expected to approximate for small A. 



The mean sum K nn (A) of the degrees of the neighbors 
of a vertex with fugacity A is 



K nn {X) = n 



/3XX' 



Ao 1 + /3AA 
9n 2 (/3A A) 3/2 

3/2 



7 k(X')p(X')dX' 
^arctan((/3A A)- 1 / 2 ) 

A 



1/2) 



j(^) [21og(l + (A/A ) 



log(l + /3A A) + O(A ^)] 



. (24) 



and from this we can calculate k nn 22] . 

These results can be extended to other values of r also, 
although the formulas are not as elegant as for the case 
r = I. For example, for general r > 1 the equivalent of 
Eq. GO! is 



k(X)=n 2 F 1 1,-1 + r;r; 



1 



I3X XJ' 



(25) 



where 2^1 is a hypergeometric function. This form is 
used in some of the calculations in the next section. 



B. Comparison with the Internet 

We now compare our model quantitatively with the 
Internet graph. To do this, it is important that we make 
the size n and number of edges fh the same as the real 
Internet, since our predictions, Eqs. (|21|l and l|24|) . are 
dependent on these quantities. For the purposes of com- 
parison, we use the data of Chen et al. 20j from 2001 
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1 10 100 1000 10000 

degree k 

FIG. 2: The cumulative distribution function for vertex de- 
gree in simulations of our model. The general form of the 
distribution is a power law for low degree with a cutoff as 
degree approaches the system size n. 



on the structure of the Internet at the autonomous sys- 
tem level, for which n — 10 697 and m = 31992, which 
gives a mean degree of z = 2rh/n = 5.98149. For the 
choice Eq. (|19|) of fugacity distribution used here, we can 
arrange for the network to have the correct mean de- 
gree by an appropriate choice of the lower limit Ao of 
the distribution, and we do this for three values r = 2.1, 
2.3, and 2.5 of the exponent of the power-law. We also 
perform extensive simulations of the model for the same 
parameter values to confirm our calculations, and ana- 
lytic and numerical results are shown in Figs. ^ 121 and [3] 
and in Table [fl As we can see, analytic and numerical 
predictions agree closely. 

Consider first Fig. ^ which is a plot of the mean de- 
gree of a vertex as a function of its fugacity. As the 
figure shows, the degree is closely linear in the fugacity 
for small A and flattens off as degree approaches n, as 
expected. 

The same behavior is evident in Fig. [21 also, which 
shows the cumulative distribution function of degrees in 
simulations of the model for power-law distributed fugac- 
ity, Eq. (|19|) . The distribution of degrees also follows a 
power law (a straight line on the logarithmic axes used), 
until degree approaches n, where the distribution is cut 
off. This is eminently sensible behavior: given the con- 
straint of single edges only, presumably the real Internet 
must deviate from power-law behavior for large degree, 
and our model should and does reflect this behavior. 

The fundamental result of this paper is shown in Fig. 01 
where we have plotted the mean degree fc nn of the neigh- 
bors of a vertex, calculated from Eq. 124|) . against the 
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degree 

FIG. 3: The mean degree k nn of the neighbors of a vertex as 
a function of the degree k of that vertex. The dotted lines 
show the asymptotic slopes of the curves. 



degree of that vertex. This is the comparison used by 
Pastor-Satorras et al. |5j to demonstrate degree anticorre- 
lation in the Internet. As the figure shows, there is a clear 
decline in the value of fc nn as degree increases, just as in 
the real Internet, confirming that the single-edge con- 
straint does indeed give rise to anticorrclations, as con- 
jectured by Maslov et al. Furthermore, the decline ap- 
pears to be approximately power-law in form fc nn ~ k~ v ', 
as found by Pastor-Satorras et al. We can deduce ap- 
proximate values for the exponent v from our results. 
We find for r = 2.1, v ~ 0.65, for r = 2.3, v ~ 0.55, 
and for r = 2.5, v ~ 0.42. The slopes are shown as the 
dotted lines in Fig. (J2J. The values for v are all close 
to the value v ~ 0.5 observed for the real Internet 
The power law is only approximate however — the func- 
tional form of Eq. (|24|l is not just a simple power law, 
and we can see from the figure that the slope of k nn is 
smaller for smaller k. The same behavior is visible in 
both the real Internet data and the simulation results of 
Maslov et al. @- 

Finally, in Table [Q we show values for the mean de- 
gree z and degree correlation coefficient r for our model. 
As we see, the theoretical calculations and numerical re- 
sults again agree well. Since the Internet is disassortative, 
we expect the degree correlation coefficient to be negative 
in the real network, and its value has been measured to 
be r = —0.189 7]. In the model we also see negative val- 
ues of r, whose magnitude depends quite strongly on the 
value of the exponent r. A detailed comparison of model 
and real-world data may therefore have to wait on more 
precise measurements of the degree distribution (about 
which there is at present some dispute |20|'). However it 
is interesting to note that none of the cases in Table |U 
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mean 


degree z 


degree 


correlation r 


T 


theory 


simulation 


theory 


simulation 


2.1 


5.981 


5.982(15) 


-0.0950 


-0.0932(17) 


2.3 


5.981 


5.972(9) 


-0.0541 


-0.0551(18) 


2.5 


5.981 


5.986(7) 


-0.0304 


-0.0321(14) 



TABLE I: Mean degree and degree correlation coefficient for 
the networks generated by our model from both the analytic 
theory and from computer simulations. The simulation re- 
sults are averaged over 1000 networks each. Figures in paren- 
theses show statistical errors on the least significant figures. 



is as strongly anticorrelated as the real Internet. Thus 
our calculations appear to indicate that some of the dis- 
assortativity in the Internet can be accounted for by the 
mechanism proposed by Maslov et ai, but probably not 
all of it. The remainder of the disassortativity is pre- 
sumably due to engineering or social constraints on the 
structure of the network. One possibility, which has been 
discussed elsewhere 0, , is that the Internet is divided 
into connectivity providers such as phone companies and 
ISPs, who tend to have very high degree, since they have 
lots of customers, and end users of connectivity, who typ- 
ically have a degree of only one or two. Most connections 
in the network run from the providers to the end users 
and are therefore from high to low degree, giving a so- 
cial reason for disassortativity in addition to the purely 
topological one considered here. 

IV. DISCUSSION AND CONCLUSIONS 

In this paper we have studied analytically ensembles 
of networks where there is at most one edge between any 
pair of vertices. By making use of an enlarged ensem- 
ble in which the number of edges is allowed to vary, in a 
manner reminiscent of the Fermi ensemble of traditional 
statistical mechanics, we have been able to find closed 
form expressions for ensemble averages of a number of 
quantities of interest. In particular, we have confirmed 
the previous numerical finding [6| that graph ensembles 
with single edges have negative correlations between the 
degrees of adjacent vertices. This has been proposed as 
a possible explanation for the anticorrelation or disassor- 
tativity observed in the topology of the Internet Q . We 
find that the restriction to single edges only can account 
for some but not all of the correlations observed in real 



Internet data. 

The same mechanism could be responsible for disas- 
sortativity in other networks also. Many networks, in- 
cluding citation networks, the World Wide Web, social 
networks, collaboration networks, metabolic and genetic 
regulatory networks, and food webs have, at least in their 
most common representations, only single edges between 
vertex pairs. Thus it is reasonable to suppose that these 
networks would be disassortative also, and indeed this 
appears to be the case for most networks that have been 
studied Q . There is one important exception to this rule 
however: social networks almost all appear to be signif- 
icantly assortative in their mixing patterns. We conjec- 
ture, therefore, that disassortativity by degree is the nor- 
mal state of affairs for a network, as a result of the mecha- 
nisms described in this paper, with social networks being 
assortative probably because of additional social effects 
that are absent from other network types; for one rea- 
son or another, it appears that gregarious people prefer 
to associate with other gregarious people. Furthermore, 
when assessing the level of assortativity in a social net- 
work, one should take into account the natural tendency 
for networks to be disassortative, since this tendency im- 
plies that to reach a level even of neutral assortativity 
would take a moderately strong bias in favor of positive 
degree correlation, and reaching a substantially assorta- 
tive state would take a very strong such bias. 

Finally, we point out that the general analytical tech- 
nique employed in this paper, of enlarging an ensemble of 
graphs to create a "grand canonical" ensemble, may have 
applications to other problems in the study of networks 
also. It is well known among statistical physicists that 
using such an ensemble often makes the analytic treat- 
ment of a problem easier, and the results presented here 
offer hope that this approach may prove useful in other 
settings. 
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